This is where the final project report write-up goes.

Before you submit, make sure everything runs as expected.

You can add sections as you see fit. Make sure you have a section called “Introduction” at the beginning and a section called “Conclusion” at the end. The rest is up to you!

##Introduction - Load the tidyverse, ggplot, and rtweet packages

rr library(tidyverse) library(ggplot2) library(rtweet) library(readr)

This data set was scraped from WineEnthusiast, a website that reviews and rates many differet types of wines.

wines <- read.csv(file = '../data/processed_data/wines.csv')
cannot open file '../data/processed_data/wines.csv': No such file or directoryError in file(file, "rt") : cannot open the connection

rr set.seed(19630217) wine_sample<- sample_n(wines, 1000)

EDA (correlation priceXpoints, with DataExplorer library? using (this)[https://datascienceplus.com/blazing-fast-eda-in-r-with-dataexplorer/])

rr wines %>% ggplot() + geom_point(mapping = (aes(x = points, y = price)), na.rm = T)

rr wines %>% summarize(mean(price, na.rm=TRUE), min(price, na.rm=TRUE), max(price,na.rm=TRUE), sd(price, na.rm=TRUE))

rr wines %>% summarize(mean(points, na.rm=TRUE), min(points, na.rm=TRUE), max(points,na.rm=TRUE), sd(points, na.rm=TRUE))

Select the provinces based on points and Select the best province for wine based on the average points of the sample size.

#find the average number of points across the 1,000 samples

rr wine_per_province <- wine_sample %>% select(province, points) %>% summarise(points = mean(points)) wine_per_province

#Find the best province for wine using the average points across the 1,000 samples #drop the descriptions or just select price? set points to max(points)

rr best_province <- wine_sample %>% group_by(province, points) %>% filter(points > 88.669) best_province

Rating distribution

Best wine, by variety

rr #sort by price, then points #want to do an interaction variableor somethin? #wine_cheap_but_good <- wines %>% group_by(variety) %>% summarise(mean_points = mean(points)) %>% arrange(desc(mean_points)) r NA NA

rr user_price <- readline(prompt = much are you willing to spend on a bottle?) 17 user_price <- as.integer(user_price)

#best_cheap_wine <- wines %>% filter(price <= user_price) %>% arrange(desc(points)) %>% select(title, price, points)

##Conclusion

LS0tCnRpdGxlOiAiRmluYWwgUmVwb3J0IGZvciBJbnRybyB0byBEYXRhIFNjaWVuY2UiCm91dHB1dDogaHRtbF9ub3RlYm9vawotLS0KVGhpcyBpcyB3aGVyZSB0aGUgZmluYWwgcHJvamVjdCByZXBvcnQgd3JpdGUtdXAgZ29lcy4gCgpCZWZvcmUgeW91IHN1Ym1pdCwgbWFrZSBzdXJlIGV2ZXJ5dGhpbmcgcnVucyBhcyBleHBlY3RlZC4KCllvdSBjYW4gYWRkIHNlY3Rpb25zIGFzIHlvdSBzZWUgZml0LiBNYWtlIHN1cmUgeW91IGhhdmUgYSBzZWN0aW9uIGNhbGxlZCAiSW50cm9kdWN0aW9uIiBhdCB0aGUgYmVnaW5uaW5nIGFuZCBhIHNlY3Rpb24gY2FsbGVkICJDb25jbHVzaW9uIiBhdCB0aGUgZW5kLiBUaGUgcmVzdCBpcyB1cCB0byB5b3UhCgoKCiMjSW50cm9kdWN0aW9uCi0gTG9hZCB0aGUgYHRpZHl2ZXJzZSwgZ2dwbG90LCBhbmQgcnR3ZWV0YCBwYWNrYWdlcwpgYGB7ciwgbWVzc2FnZT1GQUxTRSwgd2FybmluZz1GQUxTRX0KbGlicmFyeSh0aWR5dmVyc2UpCmxpYnJhcnkoZ2dwbG90MikKbGlicmFyeShydHdlZXQpCmxpYnJhcnkocmVhZHIpCmBgYAoKClRoaXMgZGF0YSBzZXQgd2FzIHNjcmFwZWQgZnJvbSBXaW5lRW50aHVzaWFzdCwgYSB3ZWJzaXRlIHRoYXQgcmV2aWV3cyBhbmQgcmF0ZXMgbWFueSBkaWZmZXJldCB0eXBlcyBvZiB3aW5lcy4gIAoKLSBUaGlzIGRhdGFzZXQgaW5jbHVkZXMgaW5mb3JtYXRpb24gb2Ygb2YgMTMwLDAwMCB3aW5lIHJldmlld3Mgd2l0aCAxMCBkaWZmZXJlbnQgZGF0YSBmaWVsZHMuIAogIAoKYGBge3J9CndpbmVzIDwtIHJlYWQuY3N2KGZpbGUgPSAnLi4vZGF0YS9wcm9jZXNzZWRfZGF0YS93aW5lcy5jc3YnKQpgYGAKCmBgYHtyfQpzZXQuc2VlZCgxOTYzMDIxNykKd2luZV9zYW1wbGU8LSBzYW1wbGVfbih3aW5lcywgMTAwMCkKYGBgCgpFREEgKGNvcnJlbGF0aW9uIHByaWNlWHBvaW50cywgd2l0aCBgYGBEYXRhRXhwbG9yZXJgYGAgbGlicmFyeT8gdXNpbmcgKHRoaXMpW2h0dHBzOi8vZGF0YXNjaWVuY2VwbHVzLmNvbS9ibGF6aW5nLWZhc3QtZWRhLWluLXItd2l0aC1kYXRhZXhwbG9yZXIvXSkKYGBge3J9CndpbmVzICU+JSAKICBnZ3Bsb3QoKSArCiAgICBnZW9tX3BvaW50KG1hcHBpbmcgPSAoYWVzKHggPSBwb2ludHMsIHkgPSBwcmljZSkpLCBuYS5ybSA9IFQpCmBgYAoKYGBge3J9CndpbmVzICU+JQogICAgc3VtbWFyaXplKG1lYW4ocHJpY2UsIG5hLnJtPVRSVUUpLCAKICAgICAgICAgICAgICBtaW4ocHJpY2UsIG5hLnJtPVRSVUUpLAogICAgICAgICAgICAgIG1heChwcmljZSxuYS5ybT1UUlVFKSwgCiAgICAgICAgICAgICAgc2QocHJpY2UsIG5hLnJtPVRSVUUpKQpgYGAKCmBgYHtyfQp3aW5lcyAlPiUKICAgIHN1bW1hcml6ZShtZWFuKHBvaW50cywgbmEucm09VFJVRSksIAogICAgICAgICAgICAgIG1pbihwb2ludHMsIG5hLnJtPVRSVUUpLAogICAgICAgICAgICAgIG1heChwb2ludHMsbmEucm09VFJVRSksIAogICAgICAgICAgICAgIHNkKHBvaW50cywgbmEucm09VFJVRSkpCmBgYAoKU2VsZWN0IHRoZSBwcm92aW5jZXMgYmFzZWQgb24gcG9pbnRzICBhbmQgU2VsZWN0IHRoZSBiZXN0IHByb3ZpbmNlIGZvciB3aW5lIGJhc2VkIG9uIHRoZSBhdmVyYWdlIHBvaW50cyBvZiB0aGUgc2FtcGxlIHNpemUuIAoKI2ZpbmQgdGhlIGF2ZXJhZ2UgbnVtYmVyIG9mIHBvaW50cyBhY3Jvc3MgdGhlIDEsMDAwIHNhbXBsZXMKYGBge3J9CndpbmVfcGVyX3Byb3ZpbmNlIDwtIHdpbmVfc2FtcGxlICU+JSAKICBzZWxlY3QocHJvdmluY2UsIHBvaW50cykgJT4lIAogIHN1bW1hcmlzZShwb2ludHMgPSBtZWFuKHBvaW50cykpCndpbmVfcGVyX3Byb3ZpbmNlCmBgYAoKCiNGaW5kIHRoZSBiZXN0IHByb3ZpbmNlIGZvciB3aW5lIHVzaW5nIHRoZSBhdmVyYWdlIHBvaW50cyBhY3Jvc3MgdGhlIDEsMDAwIHNhbXBsZXMKI2Ryb3AgdGhlIGRlc2NyaXB0aW9ucyBvciBqdXN0IHNlbGVjdCBwcmljZT8gc2V0IHBvaW50cyB0byBtYXgocG9pbnRzKQpgYGB7cn0KYmVzdF9wcm92aW5jZSA8LSB3aW5lX3NhbXBsZSAlPiUgCiAgZ3JvdXBfYnkocHJvdmluY2UsIHBvaW50cykgJT4lIAogIGZpbHRlcihwb2ludHMgPiA4OC42NjkpCmJlc3RfcHJvdmluY2UgIApgYGAKCgpSYXRpbmcgZGlzdHJpYnV0aW9uCgpgYGB7cn0KCmBgYAoKQmVzdCB3aW5lLCBieSB2YXJpZXR5CmBgYHtyfQojd2luZV9iZXN0X3ZhcmlldHkgPC0gCndpbmVzICU+JSAKICBncm91cF9ieSh2YXJpZXR5KSAlPiUgCiAgc3VtbWFyaXNlKG1lYW5fcG9pbnRzID0gbWVhbihwb2ludHMpKSAlPiUgCiAgYXJyYW5nZShkZXNjKG1lYW5fcG9pbnRzKSkgCiAgCmBgYAoKYGBge3J9CnVzZXJfcHJpY2UgPC0gcmVhZGxpbmUocHJvbXB0ID0gIkhvdyBtdWNoIGFyZSB5b3Ugd2lsbGluZyB0byBzcGVuZCBvbiBhIGJvdHRsZT8iKQp1c2VyX3ByaWNlIDwtIGFzLmludGVnZXIodXNlcl9wcmljZSkKCndpbmVzICU+JSAKICBmaWx0ZXIocHJpY2UgPD0gdXNlcl9wcmljZSkgJT4lIAogIGFycmFuZ2UoZGVzYyhwb2ludHMpKSAlPiUgCiAgc2VsZWN0KHRpdGxlLCBwcmljZSwgcG9pbnRzKQpgYGAKCgoKIyNDb25jbHVzaW9uCg==